NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A statistical framework of watermarks for large language models: Pivot, detection efficiency and optimal rules

https://doi.org/10.1214/24-AOS2468

Li, Xiang; Ruan, Feng; Wang, Huiyuan; Long, Qi; Su, Weijie J (February 2025, The Annals of Statistics)

Full Text Available
Generate, Then Retrieve: Addressing Missing Modalities in Multimodal Learning via Generative AI and MoE

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Ding, Ying; Long, Qi; Chen, Tianlong (March 2025, aUpA5gulZ4)

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to retrieve embeddings for missing modalities, two main bottlenecks remain: (1) the need to consider both intra- and inter-modal context, and (2) the cost of embedding selection, where embeddings often lack modality-specific knowledge. To address this, the authors propose MoE-Retriever, a novel framework inspired by Sparse Mixture of Experts (SMoE). MoE-Retriever defines a supporting group for intra-modal inputs—samples that commonly lack the target modality—by selecting samples with complementary modality combinations for the target modality. This group is integrated with inter-modal inputs from different modalities of the same sample, establishing both intra- and inter-modal contexts. These inputs are processed by Multi-Head Attention to generate context-aware embeddings, which serve as inputs to the SMoE Router that automatically selects the most relevant experts (embedding candidates). Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data.
more » « less
Full Text Available
Sparse MoE as a New Retriever: Addressing Missing Modality Problem in Incomplete Multimodal Data

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Long, Qi; Chen, Tianlong (February 2025, ICLR 2025 https://openreview.net/forum?id=j9DbobO0mY)

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to effectively retrieve embeddings for missing modalities, two main bottlenecks remain: the consideration of both intra- and inter-modal context, and the cost of embedding selection, where embeddings often lack modality-specific knowledge. In response, we propose MoE-Retriever, a novel framework inspired by the design principles of Sparse Mixture of Experts (SMoE). First, MoE-Retriever samples the relevant data from modality combinations, using a so-called supporting group to construct intra-modal inputs while incorporating inter-modal inputs. These inputs are then processed by Multi-Head Attention, after which the SMoE Router automatically selects the most relevant expert, i.e., the embedding candidate to be retrieved. Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data.
more » « less
Full Text Available
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Xu, Sirui; Li, Dongting; Zhang, Yucheng; Xu, Xiyan; Long, Qi; Wang, Ziyin; Lu, Yunzhi; Dong, Shuchang; Jiang, Hezi; Gupta, Akshat; et al (June 2025, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
SDoH-GPT: using large language models to extract social determinants of health

https://doi.org/10.1093/jamia/ocaf094

Consoli, Bernardo; Wang, Haoyang; Wu, Xizhi; Wang, Song; Zhao, Xinyu; Wang, Yanshan; Rousseau, Justin; Hartvigsen, Tom; Shen, Li; Wu, Huanmei; et al (June 2025, Journal of the American Medical Informatics Association)

Abstract ObjectiveExtracting social determinants of health (SDoHs) from medical notes depends heavily on labor-intensive annotations, which are typically task-specific, hampering reusability and limiting sharing. Here, we introduce SDoH-GPT, a novel framework leveraging few-shot learning large language models (LLMs) to automate the extraction of SDoH from unstructured text, aiming to improve both efficiency and generalizability. Materials and MethodsSDoH-GPT is a framework including the few-shot learning LLM methods to extract the SDoH from medical notes and the XGBoost classifiers which continue to classify SDoH using the annotations generated by the few-shot learning LLM methods as training datasets. The unique combination of the few-shot learning LLM methods with XGBoost utilizes the strength of LLMs as great few shot learners and the efficiency of XGBoost when the training dataset is sufficient. Therefore, SDoH-GPT can extract SDoH without relying on extensive medical annotations or costly human intervention. ResultsOur approach achieved tenfold and twentyfold reductions in time and cost, respectively, and superior consistency with human annotators measured by Cohen's kappa of up to 0.92. The innovative combination of LLM and XGBoost can ensure high accuracy and computational efficiency while consistently maintaining 0.90+ AUROC scores. DiscussionThis study has verified SDoH-GPT on three datasets and highlights the potential of leveraging LLM and XGBoost to revolutionize medical note classification, demonstrating its capability to achieve highly accurate classifications with significantly reduced time and cost. ConclusionThe key contribution of this study is the integration of LLM with XGBoost, which enables cost-effective and high quality annotations of SDoH. This research sets the stage for SDoH can be more accessible, scalable, and impactful in driving future healthcare solutions.
more » « less
Full Text Available
Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

Xiao, Jiancong; Sun, Ruoyu; Long, Qi; Su, Weijie (July 2024, COLT 2024)

Training Deep Neural Networks (DNNs) with adversarial examples often results in poor generalization to test-time adversarial data. This paper investigates this issue, known as adversarially robust generalization, through the lens of Rademacher complexity. Building upon the studies by Khim and Loh (2018); Yin et al. (2019), numerous works have been dedicated to this problem, yet achieving a satisfactory bound remains an elusive goal. Existing works on DNNs either apply to a surrogate loss instead of the robust loss or yield bounds that are notably looser compared to their standard counterparts. In the latter case, the bounds have a higher dependency on the width m of the DNNs or the dimension d of the data, with an extra factor of at least O(√m) or O(√d). This paper presents upper bounds for adversarial Rademacher complexity of DNNs that match the best-known upper bounds in standard settings, as established in the work of Bartlett et al. (2017), with the dependency on width and dimension being O(ln(dm)). The central challenge addressed is calculating the covering number of adversarial function classes. We aim to construct a new cover that possesses two properties: 1) compatibility with adversarial examples, and 2) precision comparable to covers used in standard settings. To this end, we introduce a new variant of covering number called the uniform covering number, specifically designed and proven to reconcile these two properties. Consequently, our method effectively bridges the gap between Rademacher complexity in robust and standard generalization.
more » « less
Full Text Available
GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing

https://doi.org/10.18653/v1/2025.naacl-long.287

Duan, Jinhao; Zhao, Xinyu; Zhang, Zhuoxuan; Ko, Eunhye Grace; Boddy, Lily; Wang, Chenan; Li, Tianhao; Rasgon, Alexander; Hong, Junyuan; Lee, Min Kyung; et al (January 2025, Association for Computational Linguistics)

Full Text Available
Fair canonical correlation analysis

Zhou, Zhuoping; Ataee-Tarzanagh, Davoud; Hou, Bojian; Tong, Boning; Xu, Jia; Feng, Yanbo; Long, Qi; Shen, Li (December 2023, NeurIPS’23: 37th Conference on Neural Information Processing Systems)

Full Text Available
Fairness-aware class imbalanced learning on multiple subgroups

Tarzanagh, Davoud Ataee; Hou, Bojian; Tong, Boning; Long, Qi; Shen, Li (July 2023, UAI’23: The 39th Conference on Uncertainty in Artificial Intelligence)

Full Text Available
Multi-Group Tensor Canonical Correlation Analysis

https://doi.org/10.1145/3584371.3612962

Zhou, Zhuoping; Tong, Boning; Ataee_Tarzanagh, Davoud; Hou, Bo-Jian; Saykin, Andrew; Long, Qi; Shen, Li (September 2023, ACM)

Full Text Available

« Prev Next »

Search for: All records